# 384 High Resolution
Vit Base Patch16 Siglip 384.webli
Apache-2.0
Vision Transformer model based on SigLIP, containing only the image encoder part, using original attention pooling mechanism
Image Classification
Transformers

V
timm
64
1
Deit Base Patch16 384
Apache-2.0
DeiT is an efficiently trained Vision Transformer model, pre-trained and fine-tuned on the ImageNet-1k dataset at 384x384 resolution, suitable for image classification tasks.
Image Classification
Transformers

D
facebook
442
3
Featured Recommended AI Models